PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID MDP0000215349
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Rosaceae; Maloideae; Maleae; Malus
Family WRKY
Protein Properties Length: 611aa    MW: 67370.7 Da    PI: 6.5714
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
MDP0000215349genomeGDRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY41.52.6e-131431822359
                    EEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
           WRKY  23 sYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59 
                    sYYrCt++   gC+++k+ver+ +dp ++++tY+g H ++
  MDP0000215349 143 SYYRCTYRftqGCRATKQVERDPDDPAMLTVTYKGVHVCR 182
                    9******9999**************************885 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007745.7E-12139183IPR003657WRKY domain
SuperFamilySSF1182903.27E-11141182IPR003657WRKY domain
Gene3DG3DSA:2.20.25.802.9E-12142182IPR003657WRKY domain
PfamPF031062.2E-11142181IPR003657WRKY domain
PROSITE profilePS5081112.254143179IPR003657WRKY domain
SuperFamilySSF509789.89E-54363610IPR017986WD40-repeat-containing domain
Gene3DG3DSA:2.130.10.102.4E-57363611IPR015943WD40/YVTN repeat-like-containing domain
SMARTSM003202.0E-5365404IPR001680WD40 repeat
CDDcd002004.15E-48368610No hitNo description
PROSITE profilePS5029433.59372540IPR017986WD40-repeat-containing domain
PROSITE profilePS500829.673372403IPR001680WD40 repeat
PfamPF004000.0018373403IPR001680WD40 repeat
SMARTSM003206.3E-10407446IPR001680WD40 repeat
PfamPF004006.5E-7412446IPR001680WD40 repeat
PROSITE profilePS5008214.084414449IPR001680WD40 repeat
PROSITE patternPS006780433447IPR019775WD40 repeat, conserved site
SMARTSM003202.3E-7450490IPR001680WD40 repeat
PfamPF004003.1E-4454490IPR001680WD40 repeat
PROSITE profilePS5008212.012457494IPR001680WD40 repeat
SMARTSM003204.8E-8493531IPR001680WD40 repeat
PfamPF004002.0E-5496530IPR001680WD40 repeat
PROSITE profilePS500829.205500540IPR001680WD40 repeat
SMARTSM003200.04572611IPR001680WD40 repeat
PROSITE profilePS5029410.364579611IPR017986WD40-repeat-containing domain
PROSITE profilePS5008210.776579611IPR001680WD40 repeat
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005515Molecular Functionprotein binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 611 aa     Download sequence    Send to blast
MKTKIXRXYE GTFIREIIAG ITVGISSSEL QRLDSPKLED IDSTEHTDIA SSVSSASSPF  60
SLAPLKKTHH EPDRSDGTSR KPSEIGKRVD SNNSTRFHLM DPVCLGSTGY VAGPLIVVAC  120
VDANTISCAC KVRVPADRSE DGSYYRCTYR FTQGCRATKQ VERDPDDPAM LTVTYKGVHV  180
CRAATGFVEG RDKIQISGLE QLHCGLLQQP KPFIQAPQPF HQLQMLTPQH QQLMLAQQNM  240
TSPSAANDES RDTDRLTKLK MAQLQQQQNS NPQQQQQQQQ QQLQQHALSN QQSQNSNLNP  300
HQQDKMGGAG SITMDGSMSN SFRGNDPDGS VGDNVESFLS PDDVDRRDAV GRCMDVSKGN  360
CIWFTFTEVN SIKASASKVT SCHFSSDGKF LASGGHDKXA VLWYTDTLKL KCTLQEHSAL  420
ITDVRFSPSK PCLATSSFDK TIRVWDADNP XYSLRXFXGH SASVMSLDFH PNKDDLICSC  480
DGDGQIRYWS INNGCCSCVS KGHTKPIHSV CWDPSGEFLA SVSEDSVRVW TLGAGSEVEC  540
VHEFSCIGNK FHSCVFHPTY TSLLTLELWN MTENKTMTXP AHEGLIDSLA VSTVTGLISS  600
ASHDTFVKIW K
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2ymu_A2e-213686106576WD-40 REPEAT PROTEIN
2ymu_B2e-213686106576WD-40 REPEAT PROTEIN
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Mdo.301991e-77leaf
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_009377500.10.0PREDICTED: transcriptional corepressor LEUNIG isoform X6
SwissprotQ9FUY21e-142LEUNG_ARATH; Transcriptional corepressor LEUNIG
TrEMBLA0A0A0LM701e-168A0A0A0LM70_CUCSA; Uncharacterized protein
STRINGGLYMA14G16040.11e-150(Glycine max)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G11070.26e-11WRKY family protein